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1 GAATTCGCGGC CGC GTCGACAd AT G C CT T CTT CT GC CT GAG ATT AC AC C C C ACT AG C C AA 6 0 

6 1 CCACTTTTTGCCTTCGAATGGAGAGATCCAGGTACGGGAAGAACCGGGCAGCTCACCTGG 12 0 

121 ACCCGACTGCCCCAAGGGTTCAAGAACTCCCCGACCATCTTTGACGAAGCCCTACACAGG 13 0 

181 GACCTGGCCAACTTCAGGATCCAACACCCTCAGGTGACCCTCCTCCAGTACGTGGATGAC 2 4 0 

241 CTGCTTCTGGCGGGAGCCACCAAACAGGACTGCTTAGAAGGTACGAAGGCACTACTGCTG 3 0 0 

301 GAATTGTCTGACCTAGGCTACAGAGCCTCTGCTAAGAAGGCCCAGATTTGCAGGAGAGAG 3 6 0 

3 61 GTAACATACTTGGGGTACAGTTTGCGGGGCGGGCAGCGATGGCTGACGGAGGCACGGAAG 4 2 0 
421 AAAACTGTAGTCCAGATACCGGCCCCAACCACAGCCAAACAAGTGAGAGAGTTTTTGGGG 4 3 0 

4 81 ACAGCTGGATTTTGCAGACTGTGGATCCCGGGGTTTGCGACCTTAGCAGCCCCACTCTAC 5 4 0 

5 41 CCGCTAACCAAAGAAAAAGGGGGATTCTCCTGGGCTCCTGAGCACCAGAAGGCATTTGAT 60 0 
601 GCTATCAAAAAGGCCCTGCTGAGCGCACCTGCTCTGGCCCTCCCTGACGTAACTAA^CCC 650 

6 61 TTTACCCTTTATGTGGATGAGCGTAAGGGAGTAGCCCGAGGAGTTTTAACCCAAACCCTA 7 2 0 

7 21 GGAC CAT GG AGGAGAC CT GTT GC CT AC CT GT C AAAGAAG CTT GAT C C T GT AG C C AGT GGT 7 3 0 
7 31 TGGCCCGTATGTCTGAAGGCTATCGCAGCTGTGGCCATACTGGTCAAGGACGCTGACAAA 3 4 0 
3 4 1 TT GACTTT GGGACAG AAT AT AACT GT AAT AGCC C C C C AT G C ATT GG AG AACAT C GTT C GG 9 0 0 
9 01 CAGCCCCCAGACCGATGGATGACCAACGCCCGCATGACCCACTATCAA^GCCTGCTTCTC 9 50 
961 ACAGAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCACTCTTCTGCCTGAA 102 0 

1021 GAG ACT GATGAACCAGT GACT CAT GATTGCCAT CAACT ATTG ATT GAGG AGACT GGGGT C 103 0 

10 81 CGCAAGGACCTTACAGACATACCGCTGACTGGAGAAGTGCTAACCTGGTTCACTGACGGA 114 0 

1141 AGCAGCTATGTGGTGGAAGGTAAGAGGATGGCTGGGGCGGCAGTGGTGGACGGGACCCGC 12 0 0 

1201 ACGATCTGGGCCAGCAGCCTGCCGGAAGGAACTTCAGCGCAAAAGGCTGAGCTCATGGCC 12 60 
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FtCjVA^fc 1 COAX- , 

12 61 CTCACGCAAGCTTTGCGGCTGGCCGAAGGGAAATCCATAAACATTTATACGGACAGCAGG 122 0 

1321 TATGCCTTTGCGACTGCACACGTACACGGGGCCATCTATAAACAAAGGGGGTTGCTTACC 13 3 0 

13 81 T CAGCAGGGAGGGAAATAAAGAACAAAGAGGAAATT CTAAGC CTATT AGAAG C CTTACAT 14 4 0 
1441 TTGC CAAAAAGGCTAGCTATT AT ACACTGT C CT GGACAT CAGAAAGC C AAAGAT CT CATA 15 0 0 
1501 TCTAGAGGGAACCAGATGGCTGACCGGGTTGCCAAGCAGGCAGCCCAGGCTGTTAACCTT 15 5 0 
15 51 CTGCCTATAATAGAAACGCCCAAAGCCCCAGAACCCAGACGACAGTACACCCTAGAAGAC 162 0 
1621 TGGCAAGAGATAAAAAAGATAGACCAGTTCTCTGAGACTCCGGAGGGGACCTGCTATACC 16 3 0 
1681 TCATATGGGAAGGAAATCCTGCCCCACAAAGAAGGGTTAGAATATGTCCAACAGATACAT 17 4 0 
17 41 CGTCTAACCCACCTAGGAACTAAACACCTGCAGCAGTTGGTCAGAACATCCCCTTATCAT 13 0 0 
13 01 GTTCTGAGGCTACCAGGAGTGGCTGACTCGGTGGTCAAACATTGTGTGCC CTGCCAGCTG 13 5 0 
13 61 GTTAATGCTAATCCTTCCAGAATACCTCCAGGAAAGAGACTAAGGGGAAGCCACCCAGGC 192 0 
1921 GCTCACTGGGAAGTGGACTTCACTGAGGTAAAGCCGGCTAAATACGGAAACAAATATCTA 19 8 0 
1981 TTGGTTTTTGTAGACACCTTT7CAGGATGGGTAGAGGCTTATCCTACTAAGAAAGAGACT 2 0 4 0 
2 041 TCAACCGTGGTGGCTAAGAAAATACTGGAGGAAATTTTTCCAAGATTTGGAATACCTAAG 2100 
2101 GTAATAGGGTCAGACAATGGTCCAGCTTTCGTTGCCCAGGTAAGTCAGGGACTGGCCAAG 2150 
2161 ATATTGGGGATTGATTGGAAACTGCATTGTGCATACAGACCCCAAAGCTCAGGACAGGTA 222 0 
2221 GAGAGGATGAATAGAACCATTAAAGAGACCCTTACCAAATTGACCACAGAGACTGGCATT 22 3 0 
22 81 AATGATTGGATGGCTCTCCTGCCCTTTGTGCTTTTTAGGGTGAGGAACACCCCTGGACAG 2 3 4 0 
2 3 41 TTTGGGCTGACCCCCTATGAATTGCTCTACGGGGGACCCCCCCCGTTGGCAGAAA.TTGCC 2 4 0 0 
2 4 01 TTTGCACATAGTGCTGATGTGCTGCTTTCCCAGCCTTTGTTCTCTAGGCTCAAGGCGCTC 2 4 60 
24 61 GAGTGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACTCAGGAGGAGACTTG 2 52 0 
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2521 CAAGTTCCACATCGCTTCCAAGTTGGAGATTCAGTCTATGTTAGACGCCACCGTGCAGGA 2 5 3 0 

2581 AACCTCGAGACTCGGTGGAAGGGACCTTATCTCGTACTTTTGACCACACCAACGGCTGTG 2 64 0 

2 641 AAAGTCGAAGGAATCCCCACCTGGATCCATGCATCCCACGTTAAGCCGGCGCCACCTCCC 27 0 0 

27 01 GATTCGGGGTGGAAAGCCGAAAAGACTGAAAATCCCCTTAAGCTTCGCCTCCATCGCGTG 2 7 60 

27 61 GTT CCTTACT CT GTCAATAACT C CT CAAGTT AAT GGT AAAC GC CTT GT G GACAGC C C G AA 2 8 2 0 
2 8 21 CTCCCATAAACCCTTATCTCTCACCTGGTTACTTACTGACTCCGGTACAGGTATTAATAT 2 8 8 0 

28 81 TAACAGCACTCAAGGGGAGGCTCCCTTGGGGACCTGGTGGCCTGAATTATATGTCTGCCT 2 94 0 

2 9 41 TCGATCAGTAATCCCTGGTCTCAATGACCAGGCCACACCCCCCGATGTACTCCGTGCTTA 3 0 0 0 
3001 CGGGTTTTACGTTTGCCCAGGACCCCCAAATAATGAAGAATATTGTGGAAATCCTCAGGA 3 0 60 

3 061 TTTCCTTT GCAAGCAATGGAGCTGCATAACTTCTAATGATGGGAATTGGAAA.TGGCCAGT 312 0 
3121 CTCTCAGCAAGACAGAGTAAGTTACTCTTTTGTTAACAATCCTACCAGTTATAAT CAATT 313 0 
3181 TAATTATGGCCATGGGAGATGGAAAGATTGGCAACAGCGGGTACAAAAAGATGTACGAAA 3 2 4 0 
32 41 TAAGCAAATAAGCTGTCATTCGTTAGACCTAGATTACTT AVUVATAAGTTTCACT AAAAA 3 3 0 0 
3 301 AAAAAAAAAAAAAAAAAAAA 3320 
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1 TGTGGGCCCCAGCGCGCTTGGAATAAAAATCCTCTTGCTGTTTGCATCAAGACCGCTTCT 60 

6 1 CGTGAGTGATTTGGGGTGTCGCCTCTTCCGAGCCCGGACGAGGGGGATTGTTCTTTTACT 120 

121 GGCCTTTCATTTGGTGCGTTGGCCGGGAAATCCTGCGACCACCCCTTACACCCGAGAACC 13 0 

181 GACTTGGAGGTAAAGGGATCCCCTTTGGAACGTGTGTGTGTGTCGGCCGGCGTCTCTGTT 24 0 

241 CTGAGTGTCTGTTTTCGGTGATGCGCGCTTTCGGTTTGCAGC7G7CCTCTCAGACCG7AA 3 0 0 

301 GGACTGGAGGACTGTGATCAGCAGACGTGCTAGGAGGATCACAGGCTGCCACCCTGGGGG 3 50 

3 61 ACGCCCCGGGAGG7GGGGAGAGCCAGGGACGCCTGGTGGTCTCCTACTGTCGGTCAGAGG 42 0 
421 ACCGAGTTCTGTTGTTGAAGCGAAAGCTTCCCCCTCCGCGGCCGTCCGACTCTTTTGCCT 4 3 0 

4 81 GCTTGTGGAAGACGCGGACGGGTCGCGTGTGTCTGGA7CTG7TGGTT7CTGTCTCGTGTG 5 4 0 
541 7 C777 G7 C77 G7 GCG7 CC77 G7 C7 ACAG7777 AA7 A7 G GGACAG AC AG7 G AC7 AC CZCZZ 6 0 0 
601 77AG777GAC7C7CGACCA77GGAC7GAAG77AGA7CCAGGGC7CA7AA7TTG7CAG77C 660 
661 AGGTTAAGAAGGGACCTTGGCAGAC77TCTG7GCC7C7GAA7GGCCAACA7TCGA7G77G 7 2 0 
7 21 GA7GGCCA7CAGAGGGGACC7T7AA77C7GAAA77A7CC7GGC7G77AAGGCAA7 CA777 7 3 0 
7 31 7T CAGACT GG ACCCGGCTCT CAT CCTGA7CAGGAGCCC7A7A7CC77 AC G7GGCAAGA77 8 4 0 
3 41 TGGCAGAAGA7CCTCCGCCATGGGTTAAACCATGGCTAAATAAACCAAGAAAGCCAGGTC 9 0 0 
9 01 CCCGAA7CC7GGC7C77GGAGAGAAAAACAAACAC7CGGCCGAAAAAG7CGAGCCC7C77 9 60 
961 CCTCG7A7C7ACCCCGAGA7CGAGGAGCCGCCGAC77GGCCGGAACCCCAACC7G77CCC 102 0 

1021 CCACCCCC77A7CCAGCACAGGG7GC7G7GAGGGGACC7C7GCCCC7CC7GGAGC7CCGG 108 0 
10 31 7 GG7GGAGGGACC7GC7GCCGGGAC7CGGAGCCGGAGAGGCGCCACCCCGGAGC GGACAG 114 0 
1141 ACGAGA7CGCGA7A7TACCGCTGCGCACC7A7GGCCC7CCCA7GCCAGGGGGCCAA77GC 12 0 0 
12 01 AGCCCC7CCAG7A77GGCCC7777C77C7GCAGA7C7C7A7AA77GGAAAAC7AACCA7C 12 60 
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12 61 CCCCTTTCTCGGAGGATCCCCAACGCCTCACGGGGTTGGTGGAGTCCCTTATGTTCTCTC 132 0 
1321 ACCAGCCTACTTGGGATGATTGTCAACAGCTGCTGCAGACACTCTTCACAACCGAGGAGC 13 3 0 

13 81 GAGAGAGAATTCTGTTAGAGGCTAGAAAAAATGTTCCTGGGGCCGACGGGCGACCCACGC 14 4 0 

14 41 AGTTGCAAAA7GAGATTGACATGGGATTTCCC7TGACTCGCCCCGGTTGGGACTACAACA 15 0 0 
1501 CGGCTGAAGGTAGGGAGAGCTTGAAAATCTATCGCCAGGCTCTGGTGGCGGGTCTCCGGG 15 50 
1561 GCGCCTCAAGACGGCCCACTAATTTGGCTAAGGTAAGAGAGGTGATGCAGGGACCGAACG 162 0 
1621 AACCTCCCTCGGTATTTCTTGAGAGGCTCATGGAAGCCTTCAGGCGGTTCACCCCTTTTG 16 3 0 
1631 ATCCTACCTCAGAGGCCCAGAAAGCC7CAGTGGCCC7GGCC77CA77GGGCAG7CGGC7C 17 4 0 
17 41 TGGATATCAGGAAGAAACTTCAGAGACTGGAAGGGTTACAGGAGGCTGAGTTACGTGATC IS 00 
1301 7 AG T G AG AGAG GC AG AG AAG G7 G 7 ATT ACAGAAGGGAGACAGAAG AG G AG AAGGAACAG A i860 
13 61 GAAAAGAAAAGGAGAGAGAAGAAAGGGAGGAAAGACG7GA7AGACGGCAAGAGAA.GAA77 192 0 
1921 TGACTAAGA7C77GGCCGCAG7GG77GAAGGGAAGAGCAGCAGGGAGAGAGAGAGAGA77 19 8 0 
19 31 77AGGAAAA77AGG7CAGGCCC7AGACAG7CAGGGAACC7GGGCAA7AGGACCCCAC7CG 2 0 4 0 
2 0 41 ACAAGGACCAG7G7GCG7A77G7AAAGAAAAAGGACAC7GGGCAAGGAAC7GCCCCAAGA 210 0 
2101 AGGGAAACAAAGGACCGAAG7CC7AGC7C7AGAAGAAGATAAAGA77AGGGGAGACGGG7 216 0 
2161 7CGGACCCCC7CCCCGAGCCCAGGG7AAC777GAAGGTGGAGGGGCAACCAG7TGAG77C 222 0 
2221 C7GG7TGA7ACCGGAGCGGAGCA77CAG7GC7GC7ACAACCA77AGGAAAAC7AAAAGAA 22 3 0 
22 81 AAAAAA7CC7GGG7GA7GGG7GCCACAGGGCAACGGCAG7A7CCA7GGAC7ACCCGAAGA 2 3 4 0 
2341 ACCG77GAC77GGGAG7GGGACGGG7AAGCCAC7CG777C7GG7CA7CCC7GAG7GCCCA 24 00 
2 4 01 G7ACCCC77C7AGG7AGAGAC77AC7GACCAAGA7GGGAGC7CAAA777C7777GAACAA 2 4 60 
2 4 61 GGAAGACCAGAAG7G7C7GTGAA7AACAAACCCA7CAC7G7G77GACCC7CCAAT7AGA7 2 52 0 
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2521 GATGAATATCGACTATATTCTCCCCAAGTAAAGCCTGATCAAGA7ATACAGTCCTGGTTG 2 5 8 0 
25 81 GAGCAGTTTCCCCAAGCCTGGGCAGAAACCGCAGGGATGGGTTTGGCAAAGCAAGTTCCC 2 64 0 
2 641 CCACAGGTT ATTCAACT GAAGGC CAGT GCTACAC CAGT AT CAGT CAGACAGT ACC C CTT G- 27 0 0 
27 01 AGTAGAGAGGCTCGAGAAGGAATTTGGCCGCATGTTCAAAGATTAATCCAACAGGGCATC 27 60 
27 61 CTAGTTCCTGTCCAATCCCCTTGGAATACTCCCCTGCTACCGGTTAGGAAGCCTGGGACC 2 32 0 
2 321 AATGATTATCGACCAGTACAGGACTTGAGAGAGGTCAATAAAAGGGTGCAGGACATACAC 2 3 3 0 
23 81 CCAACGGTCCCGAACCCTTATAACCTCTTGAGCGCCCTCCCGCCTGAACGGAACTGGTAC 2 9 4 0 

29 41 ACAGTATT GGACTT AAAAGAT GC CTT CTT CT G C CT GAG ATT AC ACC C C ACT AGC CAAC C A 3 0 0 0 
3001 CTTTTTGCCTTCGAATGGAGAGATCCAGGTACGGGAAGAACCGGGCAGCTCACCTGGACC 3 0 60 

30 61 CGACTGCCCCAAGGGTTCAAGAACTCCCCGACCATCTTTGAC GAAGCCCTACACAGGGAC 312 0 
3121 CT G GCCAACTT CAGGAT C CAAC ACC CTCAGGTGACCCTCCTC CAGT AC GTGGATGACCTG 313 0 
2131 CTTCTGGCGGGAGC^CCAAACAGGACTGCTTAGAAGGTACGAAGGCACTACTGCTGGAA 32 4 0 

32 41 TTGTCTGACCTAGGCTACAGAGCCTCTGCTAAGAAGGCCCAGATTTGCAGGAGAGAGGT A 3 3 0 0 
3 3 01 ACATACTTGGGGTACAGTTTGCGGGGCGGGCAGCGATGGCTGACGGAGGCACGGAAGA^A 3 3 60 

33 61 ACTGTAGTCCAGATACCGGCCCCAACCACAGCCAAACAAGTGAGAGAGTTTTTGGGGACA 3 4 2 0 
3421 GCTGGATT7TGCAGACTGTGGATCCCGGGG7TTGCGACCTTAGCAGCCCCACTCTACCCG 3 4 3 0 
3 4 31 CTAACCAAAGAAAAAGGGGGATTCTCCTGGGCTCCTGAGCACCAGAAGGCATTTGATGCT 3 5 4 0 
3541 ATCAAAAAGGCCCTGCTGAGCGCACCTGCTCTGGCCCTCCCTGACGTAACTAAACCCTTT 3 600 
3 601 ACCCTTTATGTGGATGAGCGTAAGGGAGTAGCCCGAGGAGTTTTAACCCAAACCCTAGGA 3 660 
3 661 CCATGGAGGAGACCTGTTGCCTACCTGTCAAAGAAGCTTGATCCTGTAGCCAGTGGTTGG 37 2 0 
3721 CCCGTATGTCTGAAGGCTATCGCAGCTGTGGCCATACTGGTCAAGGACGCTGACAAATTG 37 30 
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37 81 ACTTT GGGACAG AAT AT AACT GT AAT AG C C C C C CAT G C ATT G G AG AAC AT C G TT C G G C AG 3 3 4 0 

3 8 41 CCCCCAGACCGATGGATGACCAACGCCCGCATGACCCACTATCAAAGCCTGCTTCTCACA 3 9 0 0 

3901 GAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCACTCTTCTGCCTGAAGAG 3 9 60 

3 961 ACT GAT GAAC CAGT GACT CAT GATT G C CAT CAACTATT GATT GAG GAGAC TGGGGTCCGC 4 02 0 

4 021 AAGGACCTTACAGACATACCGCT GACT GGAGAAGTGCT AAC CTGGTT CACT GACGGAAGC 4 0 3 0 
4 0 81 AGCTATGTGGTGGAAGGTAAGAGGATGGCTGGGGCGGCAGTGGTGGACGGGACCCGCACG 414 0 
4141 ATCTGGGCCAGCAGCCTGCCGGAAGGAACTTCAGCGCAAAAGGCTGAGCTCATGGCCCTC 4 2 0 0 
4201 AC G C AAG CTTT GC GG CT G GC C GAAG G GAAAT C C AT AAAC ATT TAT AC GG AC AG C AG GT AT 4 2 6 0 
4 2 61 GCCTTTGCGACTGCACACGTACACGGGGCCATCTATAAACAAAGGGGGTTGCTTACCTCA 4 3 2 0 
4 321 GCAGGGAGGGAAATAAAGAACAAAGAGGAAATTCTAAGCCTATTAGAAGCCTTACATTTG 4 3 3 0 
4 381 C CAAAAAGGCTAG CT ATT AT AC ACT GT C CT GGACAT CAG AAAGC CAAAGAT CT C AT AT CT 4 4 4 0 
4 4 41 AGAGGGAACCAGATGGCTGACCGGGTTGCCAAGCAGGCAGCCCAGGCTGTTAACCTTCTG 4 5 0 0 
4 501 C CT AT AAT AGAAAC G CC CAAAG C C C C AG AA.C C CAG AC G AC AGT AC AC C CT AG AAG ACT G G 4 5 50 
4 5 61 CAAGAGATAAAAAAGATAGACCAGTTCTCTGAGACTCCGGAGGGGACCTGCTATACCTCA 4 62 0 
4 621 TATGGGAAGGAAATCCTGCCCCACAAAGAAGGGTTAGAATATGTCCAACAGATACATCGT 4 6 8 0 
4 681 CTAACCCACCTAGGAACTAAACACCTGCAGCAGTTGGTCAGAACATCCCCTTATCATGTT 4 7 4 0 
4741 CTGAGGCTACCAGGAGTGGCTGACTCGGTGGTCAAACATTGTGTGCCCTGCCAGCTGGTT 4 8 00 
4 801 AATGCTAATCCTTCCAGAATACCTCCAGGAAAGAGACTAAGGGGAAGCCACCCAGGCGCT 4 3 60 
4 8 61 CACTGGGAAGTGGACTTCACTGAGGTAAAGCCGGCTAAATACGGAAACAAATATCTATTG 4 92 0 
4 921 GTTTTTGTAGACACCTTTTCAGGATGGGTAGAGGCTTATCCTACTAAGAAAGAGACTTCA 4 98 0 
4 981 ACCGTGGTGGCTAAGAAAATACTGGAGGAAATTTTTCCAAGATTTGGAATACCTA^GGTA 504 0 
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5041 A7AGGG7CAGACAA7GG7CCAGC777CG77GCCCAGG7AAG7CAGGGAC7GGCCAAGA7A 5100 
5101 TTGGGGATTGATTGGAAACTGCATTGTGCATACAGACCCCAAAGCTCAGGACAGGTAGAG 516 0 

5161 AGGATGAATAGAACCATTAAAGAGACCCTTACCAAATTGACCACAGAGACTGGCATTAAT 522 0 

5221 GATTGGATGGCTCTCCTGCCCTTTGTGCTTTTTAGGGTGAGGAACACCCCTGGACAGTTT 52 8 0 

281 GGGCTGACCCCCTATGAATTGCTCTACGGGGGACCCCCCCCG7TGGCAGAAATTGCCTTT 

341 GCACATAGTGCTGATCTGCTGCTTTCCCAGCCTTTGTTCTCTAGGCTCAAGGCGCTCGAG 

4 0 1 TGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACTCAGGAGGAGACTTGCA^ 

4 6 1 GTTCCACATCGCTTCCAAGTTGGAGATTCAGTCTATGTTAGACGCCACCGTGCAGGAAAC 
52 1 CTCGAGACTCGGTGGAAGGGACCTTATCTCGTACTTTTGACCACACCA^CGGCTGTGAAA 

5 8 1 GTCGAAGGAATCCCCACCTGGATCCATGCATCCCACGTTAAGCYGGCGCCACCTCCCGAC 
64 1 TCGGGGTGGAGAGCCGAAAAGACTGAGAATCCCC7TAAGCTTCGCGTCCATCGCCTGGTT 
7 0 1 CCTTACTCTAACAATAACTCCCCAGGCCAGTAGTAAACGCCTTATAGACAGCTCGAACCC 
7 5 1 C ^"-GACCTT7ATCCCTTACC7GGCTGAT?ATTGACCCTGATACGGGTG7CACTGTAAA 
921 TAGCAC7C(aGG7GTTGC7CCTAGAGGCACCTGGTGGCCTGAACTGCATTTCTGCCTCCG 
831 A77GA77AACCCCGC7G77AARAGCACACC7CCCAACCTAGTCCG7AGTTA7GGGT7C7A 
S41 T7GC7GCCCAGGCACAGAGAAAGAGAAA7AC7G7GGGGGT7CTGGGGAATCCTTCTGTAG 
001 GAGA7GGAGC7GCGTCACC7CCAACGATGGAGACTGGAAA7GGCCGATCTCTCTCCAGGA 

5061 CCGGGTAAAA77C7CC777G7CAA7TCCGGCCCGGGCAAGTACAAAATGA7GAAACTA7A 

5121 TAAAGATAAGAGC7GC7CCCCA7CAGAC7TAGAT7ATCTAA.AGA7.AAGTT7CAC7GAAAG 

6181 GAAAACAGGAAAA7A77CAAAAG7GGA7AAATGG7ATGAGC7GGGGAA7AG777777ATT 6240 

6241 A7A7GGCGGGGGAGCAGGG7CCAC7T7AACCA77CGCC77AGGA7AGAGACGGGGACAGA 6300 



5340 



5400 



54 60 



5520 
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5640 



5700 
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60 6 0 
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6301 ACCCCCTG7GGCAATGGGACCCGATAAAG7AC7GGC7GAACAGGGGCCCCCGGCCC7GGA 63 60 

6361 GCCACCGCATAACTTGCCGGTGCCCCAATTAACCTCGCTGCGGCCTGACATAACACAGCC 642 0 

6421 GCCTAGCAACAGTACCACTGGATTGATTCC7ACCAACACGCCTAGAAACTCCCCAGGTGT 6480 

6481 TCCTGTTAAGACAGGACAGAGACTCTTCAGTC7CATCCAGGGAGCTTTCCAAGCCATCAA 65 4 0 

6541 CTCCACCGACCCTGATGCCACTTCTTCTTGTTGGCTTTGTCTATCCTCAGGGCCTCCTTA 6600 

6601 TTAT GAGGGGAT GGCTAAAGAAAGAAAA77 CAAT GT GACCAAAGAGCA7 AGAAA7 CAAT G 6 6 6 0 

6661 TACA7GGGGG7CCCGAAA7AAGC7TACCC7CAC7GAAG777CCGGGAAGGGGACA7GCAT 6720 

6721 AGGAAAAGCTCCCCCATCCCACCAACACCTTTGCTATAGTACTGTGGTTTATGAGCAGGC 67 8 0 

6781 CTCAGAAAATCAGTATTTAGTACCTGGTTATAACAGGTGGTGGGCATGCAATACTGGGTT 5340 

6841 AACCCCCTGTGTTTCCACCTCAGTCTTCAACCAATCCAAAGATTTCTG7GTCATGGTCCA 5900 

6901 AA7CG7CCCCCGAG7G7AC7ACCA7CC7GAGGAAG7GG7CC77GA7GAA7A7GAC7ATCG 5 9 60 

6961 G7A7AACCGACCAAAAAGAGAACCCG7A7CCC77ACCC7AGC7G7AA7GC7CGGA7TAGG 7Q20 

7 021 GACGGCCG77GGCG7AGGAACAGGGACAGC7GCCC7GA7CACAGGACCACAGCAGC7AGA 7 0 3 0 

7 081 GAAAGGAC77GG7GAGC7ACA7GCGGCCA7GACAGAAGA7C7CCGAGCC77AAAGGAG7C 714 0 

7141 7G77AGCAACC7AGAAGAG7CCC7GAC77C777G7C7GAAG7GG77CTACAGAACCGGAG 7200 

7201 GGGA77AGA7C7GC7G777C7AAGAGAAGG7GGG77A7G7GCAGCC77AAAAGAAGAA7G 72 60 

7 261 77GC77C7A7G7AGATCAC7CAGGAGCCATCAGAGAC7CCATGAACAAGC7TAGA\AAAA 7 3 2 0 

7321 G77AGAGAGGCG7CGAAGGGAAAGAGAGGC7GACCAGGGG7GG777GAAGGA7GG77CAA 733 0 

7381 CAGG7C7CC77GGA7GACCACCC7GC7T7C7GC7C7GACGGGGCCCC7AG7AG7CC7GC7 74 4 0 

7 4 41 CC7G77AC7TACAG77GGGCC77GC77AA77AA7AGG777G77GCC777G77AGAGAACG 7 5 00 

7 501 AG7GAG7GCAG7CCAGA7CA7GG7AC77AGGCAACAG7ACCAAGGCC77C7GAGCCAAGG 7 5 50 
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7561 AGAAACTGACCTCTAGCCTTCCCAGTTCTAAGATTAGAACTATTAACAAGACAAGAAGTG 7620 

7621 GGGAATGAAAGGATGAAAATGCAACCTAACCCTCCCAGAACCCAGGAAGTTAATAAAAAG 7680 

7 681 CTCTAAATGCCCCCGAATTMCAGACCCTGCTGGCTGCCAGTAAATAGGTAGAAGGTCACA 7 7 4 0 

7741 CTTCCTATTGTTCCAGGGCCTGCTATCCTGGCCTAAGTAAGATAACAGGAAATGAG7TGA 7900 

7 3 01 -AATCGCTTATCTGGATTCTGTAAAACTGACTGGCACCATAGAAGAATTGATTACACAT 7 8 60 

7861 TGACAGCCCTAGTGACCTATCTCAACTGCAATCTGTCACTCTGCCCAGGAGCCCACGCAG 7920 

7 9 21 ATGCGGACCTCCGGAGCTATTTTAAAATGATTGGTCCACGGAGCGCGGGCTCTCGATATT 7 5 3 0 

7 931 - - AAAATGATTGGTCCATGGAGCGCGGGCTCTCGATATTTTAAAATGATTGGTTTGTGAC 8 0 4 0 

8041 GCACAGGCTTTG7TGTGAACCCCATAAAAGCTGTCCCGATTCCGCACTCGGGGCCGCAGT 8100 

8101 CCTCTACCCCTGCGTGGTGTACGACTGTGGGCCCCAGCGCGCTTGGAATAAAAATCCTCT 3150 
8161 TGCTGTTTGCATCAAAAAAAAAAAAAAAAAAAAAAA 8196 
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1 G TGGTGTACGACTGTGGGCCCCAGCGCGCTTGGAATAAAAA7CCTCTTGCTGTTTGCATC 60 
61 AAGACCGCTTCTCGTGAGTGATTTGGGGTGTCGCCTCTTCCGAGCCCGGACGAGGGGGAT 1 20 
12 1 TGTTCTTTTACTGGCCTTTCATTTGGTGCGTTGGCCGGGAAATCCTGCGACCACCCCTTA 1 3 0 
1 3 1 ^CCCGAGAACCGACTTGGAGGTAAAGGGATCCCCTTTGGAACGTGTGTGTGTGTCGGCC 2 4 0 

2 4 1 GGCGTCTCTGTTCTGAGTGTCTGTTTTCGGTGATGCGCGCTTTCGGTTTGCAGCTGTCC7 3 0 0 

3 0 1 CTCAGACCGTAAGGACTGGAGGACTGTGATCAGCAGACGTGCTAGGAGGATCACAGGCTG 3 5 0 
361 CCACCCTGGGGGACGCCCCGGGAGGTGGGGAGAGCCAGGGACGCCTGGTGGTCTCCTACT 420 
421 ^CGGTCAGAGGACCGAGTTCTGTTGTTGAAGCGAAAGCTTCCCCCTCCGCGGCCG-CCG 430 

4 3 1 ACTCTTTTGCCTGCTTGTGGAAGACGCGGACGGGTCGCGTGTGTCTGGATCTGTTGGTTT 5 4 0 

5 4 1 CTGTCTCGTGTGTCTTTGTCTTGTGCGTCCTTGTCTACAGTTTTAATATGGGACAGACAG 60 0 

MetGlyGlnThrV 

601 TGACTACCCCCCTTAG^^GAC ' ~ — 

°^ eui - r ^ euThri - eu AspHisTrpThrGluValArgSerArgAlaHi 5 A 

721 -0 

™ST-— ^ 340 

341 ^pG^nsri^ ^00 

9 . -- OG -yP«A-'gIxe^euAiaLeuGlyGluLysA 3 nLy S Hi^SerAlaGluLy S V 

^rS^^^ 1020 
coiecSe.SerTycLeuProArgAspArgGlyAlaAlaAspL^uAlaGIyThrP 

roTh.Cy S Ser?r=ThrProLeuSerSerThrGlyCysC y sGluGlyThrS e =Ala?r=P 
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1081 ssss±kss^ - 

1141 -o 

1201 '^S^S^ »«« 

1261 Jsssss™^^^ 

1321 TTATGTTCTCTCACCAGCCTACTTGGGATGATTGTCAACAQCrr'-'- Tr-jrj^r-, ' 

1381 CAACCGAGGAGCGAGAGAGAATTCTGTTAGAGGCTAGAAAAAATGT-rr^rrrr.— . ^ 

fc^luGluArgGluArgll.L^ 1440 

14 41 GGCGACCCACGCAGTTGCAAAATGAGATTGACATGGGATT7CCCTTGACTCGCC. - . n n 

lyAr gP roThrGlnLeuGlnA S nGluIleA S p Me ,Glv P heP^£;SL^gp:; G r;; i5 °° 

1501 »« 
1561 

1621 AGGGACCGA^CGAACCTCCCTCGGTATTTC^TGAGAGGC^-i-r--zi^^^^. 

1681 Jffi^^S^^S^^^ ™ 

1741 nssss? »°° 

1801 .... 

1921 ^AGAAGAATTTGACTAAGATCTTGGCCGCAGTGGTTGAAGGGAAGAGCArrArrr^^ 
ln0l ^y^ n L.«Thr l .y.Il.L.uAl^.v. 1 valGluG?S^S^5SS 

1981 ^G^*^^^a^^^^^"^^^^~^^^^^^^^^^^^^^^GGGAACCTGGGCAA' r A ^O.n 

-gGluArgAspPheArgLy.IleArgSecGlyProArgGlnSerGlyA^nLluG^, * ' ° 
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2041 5S5S25SSS25^ 
2101 SSKSSSS^^ 

2221 -^11^™--- 2280 
2321 SS^^^^ss^^^^i 

2541 ^sssisis^ »oo 

2161 sxssssa^^ 

2821 sssssss^^ 

2881 s~ == c^^ 

2941 ASSESS— - 
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3001 CCACTAGCCAACCACTTTTTGCCTTCGAATGGAGAGATCCAGGTACGGGAAGAACC-GC ™«« 
- "ThrSerGlnProLeuPheAlaPheGluTrpArgAspPcoGlvThr^r^hrG^G 

30 61 AGCTCACCTGGACCCGACTGCCCCAAGGGTTCAAGAACTCCCCGACCATC^TGAC-AA- ,i , rt 
lnLeuThrTrpThrArgLeuProGlnGlyPheLvsAsnSerProThrllLohlArpG^ 

3 12 1 CCCTACACAGGGACCTGGCCAACTTCAGGATCCAACACCCTCAGGTGACCCTCCT-AG- ^ a n 
laLeuH 1S ArgA SP LeuAlaA S nPheAr g lleGlnHi3ProGlnvIi;hrLluL^G?nT ° 

3181 ACGTGGATGACCTGCTTCTGGCGGGAGCCACCAAACAGGACTGCTTAGAA-GTArr- a ,-: 

^•^pAspl.uL.uL.uAl.GiyAl.ThrLy.Gl^^SSS^S^ 324 ° 

32 41 CACTACTGCTGGAATTGTCTGACCTAGGCTACAGAGCC^^GCTAAGAASecrra. ^ , 

33 01 GCAGGAGAGAGGTAACATACTTGGGGTACAGTTTGCGGGGCGGGCAGCGATGGC r' c „ -n 

ysArgArgGluValThrTycLeuGlyTyrSerLeuArgGlyGivGlSrSSL^^rG 

3361 AGGCACGGAAGAAAACTGTAGTCCAGATACCGGCCCCAAC'-AC^GCr».Ar- 

luAlaArqLvsi vs^hrValv,! r: ,-i . , ^^ UnWGC -' wAi:: '^^GA u AG 3420 
=^-9-ys-ys i nrvalValGj.r.ile?coAj.aPr3ThrThrAial.ysGir.ValArgG 

34 21 AGTTTTTGGGGACAGCTGGATTTTGCAGACTGTGGATCCCGGGGTTT GCGACCT ^ A , . . . 

luPhe^uGlyThrAlaGlyPheCysArgLeuTrpI^P^GlyPh^lSh^^A 

34 81 CCCCACTCTACCCGCTAACCAAAGAAAAAGGGGGATTCTCCTGGGCTCCrGAGCACC- - « - n 
^P'oLeuTyrProLeuThcLysGluLysGlvGlyPh^SerTrpAlaPriGluHTsGT^ 

354 1 ^^TTTGATGCTATCAAAAAGGCCCTGCTGAGCGCACCTGCTCTGGCC^—C-T-C- k„„ 
y^AlaPheA.pAlalleLysLyaAlaLeuLeuSerAiaProAlaLe^AlaE^P^^pv ° 

3601 TAACTAAACCCTTTACCCTTTATGTGGATGAGCGTAAGGGAGTAGCCCGAGGAGT— a a , ~„ 
alThrLy S P r oPheThrLeuTyrValA S? GiuArgLysGlyValS^rjG^ 3 °°° 

3651 h r GlnTh r ^e^r^o GG ' AGGA ' G ' A< "' ^ GTT GC CT ACCT GT CAAAGAAG CTT GAT C C~" 3 3 7 , 0 
hrGlnThrLeuGlyProTrpArgArgP = oVaiAlarvrLeuSerLy S I. ys LeuA S p ? r;v 

3721 a^SelS^TT^ 3780 
a-A.aSerGlyTrpProValCy S LeuLy S Ala:ieAiaAIaVaiAlaIleLeuVaILysA 

37 8 1 ACGCTGACAAATTGACTTTGGGACAGAATATAACTGTAATAGCCCCCCATGCAT-rr- - , «, , „ 
*P^^PLy«L.uThrL.uGlyGlnA a nIieThrVaili^SroSi A ^LSGtS 

3941 sSI f SITf G G ^ GCCCCCAGACCGATGGATGA ^^CGCCCGCATGACCCACTATC^ 3900 
S nIleValAr g GlaProProAs P ArgTrpMerThrA S aAlaArgMerThrH iS TyrGT^5 

3 901 CCCTGCTTCTCACAGAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCAC 
erLeuLeuLeuThrGluArgVaiThrPheAlaProProAlaAlaLeuAsnPr^la^ 



■CTC 2960 
ThrL 
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3961 TTCTGCCTGAAGAGACTGATGAACCAGTGACTCATGATTGCCATCAACTATTGAT7GAGG 4020 
euLeuProGluGluThrAspGluProValThrHisAspCysHi^GlnLeuLeuIleGluG 

4 021 AGACTGGGGTCCGCAAGGACCTTACAGACATACCGCTGACTGGAGAAGTGCTAACCTGGT 4 0 8 0 
luThrGlyValArgLysAspLeuThrA^plleProLeuThrGIyGiuValLe^hrTrpP 

4 081 TCACTGACGGAAGCAGCTATGTGGTGGAAGGTAAGAGGATGGCTGGGGCGGCAGTGGTGG 4 14 0 
heThrAspGlySerSerTyrValValGluGlyLysArgMetAlaGIyAlaAiaValValA 

4141 ACGGGACCCGCACGATCTGGGCCAGCAGCCTGCCGGAAGGAACTTCAGCGCAAAAGGC7G 4,nn 
spGlyThrArgThrlleTrpAlaSerSerLeuProGluGlyThrSerAlaG^JAV^ 

4201 AGCTCATGGCCCTCACGCAAGCTTTGCGGCTGGCCGAAGGGAAATCCA T AAAC A.f"T" r ^"" A 4 ? S n 
lu.euMetAlaL.uThrGinAlaLeuArgLeuAlaGluGlyLysSeSi^SieTy;? 

hrA^pSerArgTy-^aPheAlaThrAla.-sValH^GlyAialleTyrLysGlnArgG 

4 321 ^TTGCTTACCTCAGCAGGGAGGGAAATAAAGAACAAAGAGGAAATTCT^GCCTATTAG 43 3 0 
lyLeuLeuThrSerAlaGlvArgGIuIleLysAsnLysGluGiuIleLeuSerLeuL^G 

4381 ^:TI^ AT T TG r ; ^ GGCTAGCTAT ^A=ACTGTCCTGGACATCAGAAAGCCA 4 4 4 0 
^uAlaLeuHisLeuProLysArgLeuAiallelleHisCysProGlyHisGlnLysAlaL 

4441 ^Spflu ^I^f^f^^ 2 ^ 4500 
ysAspKeuUeSerArgGlyAsnGlnMecAIaAspArgValAlaLysGlnAlaAlaGlnA 

4 501 CT J^CCTTCTGCCTAT-AATAGAAACGCCCAAAGCCCCAGAACCCAGACGACAGTACA 4 5 6 0 
laValA S n i ,euLeu?r = Il a TieGluThr ? roLy 3 Ala?roGlu? : oArgA.-gGlnTy=T 

4561 h llluT^ GAC l GGC -^ GA ; AAAAAAGATAGACCAG 7 7C7C7GAGAC7CCGGAGGGGA 4520 
hrLeuGluAs P 7rpGinGlaIleLy S Ly S Il e AspGlnPh^SarGIuThrPr=GluGlyT 

4621 w ^ GC I ATA ^ CTCA7ATGGGAAGGA ^TCC7GCCCCACAAAGAAGGG77AGAA7ATG7CC 4 63 0 
hrCys T yr7hrS e r7yrGly L y S GluIleL e u P roH i sLy S GluGlyL e uGluTyrValG 

4581 ^GtST^f TCT ^ CCCACCTAGGAA ^AAACACC7GCAGCAG7TGG7CAGAACA7 4 7 4 0 
InGinllsHxsArgLeuThrHisLeuGlyThrLysHisLeuGlnGlnLeuVaiArgThrS 

474 1 ero^^^f/Tr GAGGCTACCAGGAGTGG ^GAC7CGG7GG7CAAACA7TGTG7GC 4800 
er.roXyrKisValLeuArgLeuProGlyValAlaAspSerValValLysHisCysValP 

4801 -oSs Gt G ^ G 5 T T^ TG r^ T f TTCCAGAATAC ^<=CAGGAAAGAGAC7A^GGGGAA 4860 
-oCysGlnLeuValAsnAlaAsnProSerAcgllePcoProGiyLysArgLe^rgGlyS 

4861 GCCACCCAGGCGC7CAC7GGGAAG7GGAC77CAC7GAGG7AAAGCCGGCXAAA7ACGGAA 4 92 0 
erHxsProGlyAlaHi S 7rpGluValA S pPhe7hrGluValLys? C oAlaL^TyrGlyJ 
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4 921 ACAAATATCTATTGGTTTTTGTAGACACCTTTTCAGGATGGGTAGAGGCTTATCCTACTA 4 9 8 0 
snLysTyrLeuLeuValPheValAspThrPheSerGlyTcpValGluAlaTyrProThrl 

4 981 AGAAAGAGACTTCAACCGTGGTGGCTAAGAAAATACTGGAGGAAATTTTTCCAAGATTT-. , n 4 0 
ysLysGluThrSerThrValValAlaLysLysIleLeuGluGluIlePheProArgP^G 

5041 GAATACCTAAGGTAATAGGGTCAGACAATGGTCCAGCTTTCGTTGCCCAGGTAAGTCAGG 5100 
lylleProLysVallleGlySerAspA^nGlyProAlaPheValAiaGlaValSeSnG 



S160 



5101 GACTGGCCAAGATATTGGGGATTGATTGGAAACTGCATTGTGCATACAGACCCCAAAGC- 
lyLeuAlaLysIleLeuGlylleAspTrpLysLeuHisCysAlaTyrArgProGlr.SerS 

5161 CAGGACAGGTAGAGAGGATGAATAG.^ACCATTAAAGAGACCCTTACCAAATTGACCACAG - 2 2 0 
erGlyGlnValGiuArgMetAsnArgThrllsLysGluThrLeuThrLysLeuThrThrG 

luThrGIylleAsnAspTrp.MecAIaLeuLeuProPheValLauPheArgValArgAsr- 

5281 h - tl^ A ^ GT J TGGGC7GACCCCCTA ^^ 5 3 4 0 

h-ProGlyGlnPheGlyLeuThrPcoTyrGluLeuLauTyrGlyGlyProProProLeuA 

laGluIxeAxaPheAlaHisSerAlaAspValLeuLeuSecGlnProLeuPheSesArgL 

54 01 TCAAGGCGCTCGAGTGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACTCAG 54 6 0 
««Ly,Al«L.uGiuTrpValArgGlnArgAlaTrpLy S GlnL.uArgGluAlaTy=S«G 

5451 w^C^T^^f/; 00 ^ 55^0 

lyGlyAspLeuGlnValProHisArgPheGlnVaiGlyAspSerVaiTyrValAxgArgh- 

5521 ACCGTGCAGGAAACCTCGAGACTCGGTGGAAGGGACCTTA7CTCGTACTTTTGACCACAC 55 3 0 
isArgAlaGlyAsnLeuGiuThrArgTrpLysGlyPrcTviLeuVaiLeuLeuT^Thr? 

^^T^f 70 ^ 5640 
-QThrAxaValLysValGluGlyrie?:=ThrTrpIlaHi S AiaS e rHisValLy S ProA 

MecHis PrcThrLeuSarArg 

5541 ? G p^ CT ^ CCGACTCGGGGTGG - G ^^GA^GAcTGAGAATCCCCTT.VAGCTTCGCC 5700 
laProProProAspSerGlyTrpArgAlaGluLysThrGiuAsnProLeuLvs^uAr- 
ArgHisLeuPrcThrArgGlyGlyGiuProLysArgLeuArgll^PrcLe^rt^n 

3701 ;S^f C r G P7 TCCT7AC7C7AAC ^ TAAC7CCCCAGGC ^ 5760 
suHisArgLeuVaiProTyrSerAsnAsnAsnSerProGlvGinrr.d 

SerlleAlaTrpPhaLeuThrLeuThrllsThrProGlr^laSerSetLysArgLeuIle 

57 61 GACAGCTCGAACCCCCATAGACCTTTATCCCTTACCTGGCTGATTATTGACCCTG^TACG 5 e - 0 
^PSerS e rA,n? = oHi S Ar5ProL«uSerLeuThrTr?L*uIIeIle Aa pProX«Thr 
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5821 GGTGTCACTGTAAATAGCACTCGAGGTGTTGCTCCTAGAGGCACCTGGTGGCCTGAACTG 58 3 0 
GlyValThrValAsnSerThrArgGlyValAlaProArgGlyThrTrpTrpProGTu^u 

588! CATTTCTGC^CCGATTGA^ 

HisPheCysLeuArgLeuIleAsnProAlaValLysSerThrProProAsnLeuValArg 

594! AGTTATGGGTTCTATTGCTGCCCAGGCACAGAGAAAGAGAAATACTGTGGGGGTTCTGGG 6000 
* SerT y rG1 y PheT y rC y sC y^«GlyThrGluLy S GiuLy S TyrCysGlyG ly SerGly 

5001 Q^^e^Bh^^^^^'^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ 1 '^'^^^^^ 6060 

GluS«rPh«CyiAx5ArgTrpSerCysValThrSerAsnAapGlyAapTr P LysTcpPra 

6061 ATCTCTCTCCAGGACCGGGTAAAATTCTCCTTTGtC.=ATTCCGGCCCGGGCAAG~ACAAA 5 1 ■> n 
xieSerLeuGlnAsoArgVa-ysPheSerPheValAsr.SerGlyProGly^ysTyr^ 

6121 :ISIr^ TA r T ^ GAT ^ GAGCTGC ^ 513 0 

MecMetLysLauTycLysAspLysSerCysSerProSerAspLeuAspTyrLeuLyslie 

5181 Jf 3 1?^!^=?^^ 62 4 0 

Se.PheThrGluArgLysThrGiyLysTyrSecLysValAspLysTrpTyrGluLeuGly 

624 1 AATAGTTTTTTATTATATGGCGGGGGAGCAGGG7CCACTTTAACCATTCGCCTTAGGATA 6300 
AanS.rPh.I.auL.uTyrGlyGlyGlyAlaGlySesTh.-X.euThrll.ArgLe^rglli 

uluThrGlyThrGluProProValAlaMecGlyProAspLysValLeuAiaGluGlnGly 

5361 =: G ;: GGCCCTG = AGGCACCGCAT ^ CTTGC 64 ■> 0 

P-oProAlaLeuGluProProHisAsr.LauProVair.-oGinLeuThrSerLeuArgPro 

A^T^^f G f TAGCAACAGTACCACTGGA -^TTCCTACCAACACGCCTAGA 64 8 0 
Aspile.hrGlnProProSerAsnSerThrThcGlyLeuIieProThrAsnThrProArg 

6481 JACTCCCCAGGTGTTCCTGTTAAGACAGGACAGAGACTCTTCAGTCTCATCCAGGGAGCT 6540 
AsnSerProGlyValProValLysThrGiyGlnArgLeuPheSerLeuIleGlnGlyAla 

6541 ^CCAAGCCATCAACTCCACCGACCCTGATGCCACTTCTTCT-GTT GGCTT T G ^c^A— - --no 
^•Gli^.Ile^nSerThr^pProA.pM.ThrSarSerCy^^^^^s.r ° 

6601 If AG ? GGCTGCT ; AT ^ A?GAGGGGATGGC ^AAAGAAAGA^ 6560 
-e.^lyProProTyrTyrGluGlyMa^AlaLysGIuAraLysPheAsnValThrLysGlu 

6661 CATAGAAATCAATGTACATGGGGGTCCCGAAATAAGCTTACCCTCACTGAAGTTTCCGGG 672 0 
riisArgAsnGlnCysThrT.-pGlySerArgAsnLysLeuThrLeuThrGluValSerGly 

6721 AAGGGGACATGCATAGGAAAAGCTCCCCCATCCCACCAACACCTTTGCTATAGTACTGTG 57 8 0 
Ly»GIyThrC ya il.GlyLvsAl«PcoProSerHi5GlnKi»L«uCy a TyrSecTSI5IJ 
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6781 ;iTr T ^ G ^ GG f CT ^ GA ^ TCAGTATTTAGTACCTGGTTAT ^CAGGTGGTGGGCA 68 4 0 
ValTyrGluGlnAlaSerGluAsnGlnTyrLeuValProGlyTyrAsnArgTrpTrpAia 

68 41 TGCAATACTGGGTTAACCCCCTGTGTTTCCACCTCAGTCTTCAACCAATCCAAAGATTTC 6900 
CysAsnThrGlyLeuThrProCysValSerThrSerValPheAsnGlnSerLysAsp^a 

690! TGTGTCATGGTCCAAATCGTCCCCCGAGTGTACTACCATCCTGAGGAAGTGGTCCTTGAT 6 9 6 0 
Cy S ValMetValGlnlleVal ProArgVaiTyrTyrHis ProGluGIuValValLeuAsp 

6951 s AA ^I GAC ^ AT f GGTATAACCGACCAAAA ^ 70?0 
GluTyrAspTyrArgTyrAsnArgProLysArgGluProValSerleuThrLeuAIaVal 

7021 ATGCTCGGATTAGGGACGGCCGTTGGCGTAGGAACAGGGACAGCTGCCCTGATCACAGG^ 7080 
MetI.euGxyLeuGlyThrAlaValGiyValGlyThrGlyThrAIaAIaX.euIleThrGry 

7081 CCACAGCAGCTAGAGAAAGGACTTGGTGAGCTACATGCGGCCATGACAGAAGATCTCCG^ 7 M 0 
--oGlnGln^euGluLysGlyLeuGlyGIuLeuHisAlaAiaMecThrGluAspLe^aArg 

7141 GCCTTAAAGGAGTCTGTTAGCAACCTAGAAGAGTCCCTGACTTC-TTGTCTGAAGTGGTT 72 0 0 
AlaLeuLysGluSerValSerAsnLeuGluGluSerLeuThrSacLeuSerGluValvil 

7201 CTACAGAACCGGAGGGGAT7AGATCTGCTGTTTCTAAGAGAAGGTGGGTTATGTGCAGCC 7? 60 
LeuGlnAsnArgArgGlyLeuAspLeuLeuPheLeuArgGluGlyGlyLeuCysAiaAia 

7 261 JTAAAAGAAGAATGTTGCTTC7ATG-AGATCACTCAGGAGCCATCAGAGACTCCATGAAC 7 3 ? 0 
ueu^ysGluGluCysCysPheTyrValAspHisSerGlyAialieA.-gAspSerMe-ATn 

7 321 AAGCTTAGAAAAAAGTTAGAGAGGCGTCGAAGGGAAAGAGAGGCTGACCAGGGGTGGT7T 7 3 8 0 
-ysx. e -uArgLysLysLeuGl'aA.rgArgArgArgGluArgGluAiaA S pGlnGlyTr?Phs 

7331 G tJ? G ;I! G rT C t GGTaTCCTTGGATGACCACC ™ 7440 
G - jG -y--PSfneAsnArgSerPcoTrpMecThrThrL«uLeuS*rAlaLeuThrGiyPro 

7441 ^SltfJ^f^^ 7500 
LeuValValLauLeuLeuLeuLeuThrValGlyProCysLeuIlaAsaArgPheValAla 

7501 =hIS; 7 ^^f/^ 7 5 6 0 

.neValArgGxuArgValSerAlaValGlnllaMecValLeuArgGinGlnTyrGlnGIy 

7361 L^lIuJf GA * GGAGAAACTGACC ^^^ 7520 
i-eui-euSerGlnGlyGluThrAspLau£nci 

7621 CAAGACAAGAAGTGGGGAATGAAAGGATGAAAATGCAACCTAACCCTCCCAGAACCCAGG 7580 
7581 AAGTTAATAAAAAGCTCTAAATGCCCCCGAATTACAGACCCTGCTGGCTGCCAGTAAATA 774 0 
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7741 GGTAGAAGGTCACACTTCCTATTGTTCCAGGGCCTGCTATCCTGGCCTAAGTAAGATAAC 7 3 0 0 

7 801 AGGAAATGAGTTGACTAATCGCTTATCTGGATTCTGTAAAACTGACTGGCACCATAGAAG 7 3 60 

7861 AATTGATTACACATTGACAGCCCTAGTGACCTATCTCAACTGCAATCTGTCACTCTGCCC 7920 

7 921 AGGAGCCCACGCAGATGCGGACCTCCGGAGCTATTTTAAAATGATTGGTCCACGGAGCGC 7 98 0 

7 981 GGGCTCTCGATATTTTAAAATGATTGGTCCATGGAGCGCGGGCTCTCGATATTTTAAAAT 8 0 4 0 

8 041 GATTGGTTTGTGACGCACAGGCTTTGTTGTGAACCCCATAAAAGCTGTCCCGATTCCGCA 810 0 
8 101 CTCGGGGCCGCAGTCCTCTACCCCTGCGTGGTGTACGACTGTGGGCCCCAGCGCGCTTGG 3160 
3161 AATAAAAATCCT CTTGCT GTTT GC AT CAAAAAAAAAAAAAAAAAAAAAA 320 9 
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Figure 4. 

The same nucleotide sequence as represented by bases 5260 to 8210 
in Figure 3 is also representative for this Figure, with the 
following changes : 





LnanQS 




5273 


G-T 




5341 


C-T 




5351 


C-T 




5353 


T-C 




5356 


C-T 




5426 


G-A 




5464 


Insertion 


AGA 


5607 


C-T 




5638 


C-T 




5792 


T-C 




6191 


Insertion 


AA 


6253 


T-A 




6255 


Insertion 


A 


6900 


C-G 





Such nucleotide changes result in the following amino acid 
changes in the ENV polypeptide. 



Position 


Chancre 


7 


R-W 


192 


R-K 


193 


Deletion 


194 


Deletion 


197 


. Y-Q 


198 


S-E 


199 


K-N 


200 


V-I 


201 


D-Q 


204 


Y-I 


205 


E-N 


206 


Insertions: G,M,S 


206 


L-W 


208 


N-I 


209 


S-V 


211 


L-Y 


212 


L-K 


427 


F-L 
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r icure a 



SFV-3 
SFV-1 N/S2.S 
HSRV 



PoEV 

GALV x , B a. EV 
" FeLV 

MuLV 




MMTV MPMV 



EIAV 



HIV-I 



RSV 




MuLV murine leukaemia virus 

FeLV feline leukaemia virus 

GaLV gtbbcn ape leukaemia virus 

SVV-1 simian foamy virus 1 

SrV-3 simian foamy virus 3 

HSRV human fcamy virus 

2LV Eovine leukaemia virus 

HTLV human 7-cell leukaemia virus 

MMTV murine mammary tumour virus 

MPMV Mason Pfizer monkey virus 

RSV Rous sarcoma virus 

FIV feline immunodeficiency virus 

HIV human immunodeficiency virus 

EIAV ecuine infectious anaemia virus 
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PPT | U3 OCT-1 c-Myb LyF-1 E47 

A A G AAGTGGGGA A.TGA A&GGAT GAAAAT GCAACCTAACCJZTCC CAGAA.CC 



ETS-1 AP- 4 

5 1 CAGGAAGTTA ATAAAAAGCTCTAAATGCCCCCGAAT TMCAGACCCTGCTG 

NF-1 AP-1/TR 

101 C-CTGCCAGTAAATA GGT AGAAGGTCACA CTTCCTATTGTTCCAGGGgCXG 

ETS-1/GATA GATA ETS-1 c-Myb AP-1 GATA 

1 5 L CT ATC CTGG CC TAAGT AAGATAACA GGAAATC-AGT T G AC TAA TCGC T TAT 

E47 AP-1 

2 01 C T GGAT T C T G T AAA AC T GAC T GGC A C C AT AGAA.GAAT T GAT T ACACATTG 

AP-1 AP-1 /GATA C-Myb AP-1 

251 ACAGCCC T AGTGACCTATCTCAACTGCAATCTGTCACT CTGCCCAGGAGC 

E47 ETS-1 CCAAT 

301 C CACGC AGATGCGGAC CTC CGGAGC TA TTT TAAAATG ATTGG TCCACGGA 

GATA CCAAT<€- 

351 GCGCGGGCTCTCGA TATTT TAAAATGATTGGTCCATGGAGCGCGGGCTCT 

GATA CCAAT<€- AP-1/CREB 

401 CGATATTTTAA AAT GATTGG TT TGTGACGCACAG GCTTTGTTGTGAACCC 

TATA U3 I R 

451 CATAAAAGCTGTCC CGATTCCGCACTCGGGGCCGCAGTCCTCTACCCCTG 

PADS Po'yA 

501 CGTGGTGTA CGACTGTGGGCCCCAGCGCGCTTGGAATjjAAAATCCTCTTG 

R I U5 

551 CTGTTTGCATCAAGACCGCTTCTYGTGAGTGATTTGGGGTGTCGCCTCTT 

US I PBS 

6 Oi. CCGAKCCCGGACGAGGGGGATTGTTC7TTTACTGGCCTTTCATTTGG_TGC 



SSL GTTGGCCGGGAAATCCTGCGACC 



